Tangerine: a large vocabulary Mandarin dictation system
نویسندگان
چکیده
The text input for non-alphabetic languages, such as Chinese, has been a decades-long problem. Chinese Dictation using large vocabulary speech recognition provides a convenient mode of text entry. In contrast to a character based Dictation system [5], a word-based Mandarin dictation system has been designed [3] (based on Apple's PlainTalk speech recognition technology [4]) for efficient entry of Chinese characters into a computer. In this paper new features and improvements to the dictation system are presented. The new features and improvements have produced an overall reduction in recognition error of 50 80%. The vocabulary has also been increased from 5000 words to over 11,000 words. The new features are:mel frequency Cepstral analysis, spectral noise subtraction, cepstral mean normalisation, HMM based tone classification, training data reduction, adaptive training, more detailed sub-syllable modeling and a statistical language model.
منابع مشابه
Complete recognition of continuous Mandarin speech for Chinese language with very large vocabulary but limited training data
This correspondence presents the first known results of complete recognition of continuous Mandarin speech for the Chinese language with very large vocabulary but very limited training data. Various acoustic and linguistic processing techniques were developed, and a prototype system of a continuous speech Mandarin dictation machine has been successfully implemented. The best recognition accurac...
متن کاملPhonetic state tied-mixture tone modeling for large vocabulary continuous Mandarin speech recognition
This paper presents a new approach to tone modeling for continuous Mandarin speech recognition. Mandarin tones provide rich information for speech recognition. In this paper, we treat the tone as an attribute of the final vowel part of a Mandarin syllable. Separate distributions are estimated for cepstral coefficients and pitch features respectively, and the phonetic state tied-mixture techniqu...
متن کاملGolden Mandarin (I)-A real-time Mandarin speech dictation machine for Chinese language with very large vocabulary
AhtractThis paper describes the first successfully implemented real-time Mandarin dictation machine developed in the world which recognizes Mandarin speech with very large vocabulary and almost unlimited texts for the input of Chinese characters into computers. Considering the special characteristics of the Chinese language, syllables are chosen as the basic units for dictation. The machine is ...
متن کاملA multi-pass error detection and correction framework for Mandarin LVCSR
We previously proposed a multi-pass framework for Large Vocabulary Continuous Speech Recognition (LVCSR). The objective of this framework is to apply sophisticated linguistic models for recognition, while maintaining a balance between complexity and efficiency. The framework is composed of three passes: initial recognition, error detection and error correction. This paper presents and evaluates...
متن کاملA Survey on Automatic Speech Recognition with an Illustrative Example on Continuous Speech Recognition of Mandarin
For the past two decades, research in speech recognition has been intensively carried out worldwide, spurred on by advances in signal processing, algorithms, architectures, and hardware. Speech recognition systems have been developed for a wide variety of applications, ranging from small vocabulary keyword recognition over dial-up telephone lines, to medium size vocabulary voice interactive com...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1995